Sequence errors described in GenBank: a means to determine the accuracy of DNA sequence interpretation.
نویسنده
چکیده
The accuracy of nucleic acid sequence data interpretation was determined by assessing and quantifying the discrepancies reported in the GenBank database. This permitted the calculation of an Error Rate (ER) for nucleic acid sequence determination. If one assumes that most entries (TB, Total Bases) were independently verified or those without reported discrepancies were correct, the ER is 0.368 errors per 1000 bases. However, if one assumes that only those sequences with reported discrepancies (TBIQ, Total Bases from entries In Question) were verified and are thus correct, the ER is 2.887 errors per 1000 bases. This establishes the first set of limit boundaries of the ER for sequence interpretation and sequence errors within the GenBank database and provides the foundation for future assessments and the monitoring of sequence data accumulation. In addition, the ER measure provides a basis to evaluate the efficiency and merit of present and future automated nucleic acid sequencing technologies which will have a direct impact upon the final outcome of the "Human Genome Initiative".
منابع مشابه
Molecular phylogeny of some avian species using Cytochrome b gene sequence analysis
Veritable identification and differentiation of avian species is a vital step in conservative, taxonomic, forensic, legal and other ornithological interventions. Therefore, this study involved the application of molecular approach to identify some avian species i.e. Chicken (Gallus gallus), Muskovy duck (Cairina moschata), Japanese quail (Coturnix japonica), Laughing dove (Streptopelia senegale...
متن کاملTaxonomic Position of Iranian Isolates of Eretmocerus mundus (Merect), a Parasitoid of Bemisia tabaci (Gennadius)
Bemisia tabaci (Gennadius) (Hemiptera: Aleyrodidae), is one of the most important pest of vegetable and fruit crops. This polyphagous pest has a range of natural enemies including the parasitoid Eretmocerus mundus (Merect) (Hymenoptera: Aphelinidae). To determine the molecular profile and taxonomic status of Iranian isolates of E. mundus, parasitized B. tabaci samples were collected from cotton...
متن کاملSequence Variations of Mitochondrial DNA Displacement-Loop in Iranian Indigenous Sheep Breeds
Mitochondrial DNA (mtDNA) has been used extensively to study population genetics because it has the unique features of maternal inheritance, a relatively fast rate of evolution and lack of recombination. A total of 82 unrelated sheep from 10 Iranian indigenous sheep breeds were investigated to determinate the maternal genetic diversity using a sequence of a 685 bp segment of the displacement lo...
متن کاملA MODEL FOR THE BASIC HELIX- LOOPHELIX MOTIF AND ITS SEQUENCE SPECIFIC RECOGNITION OF DNA
A three dimensional model of the basic Helix-Loop-Helix motif and its sequence specific recognition of DNA is described. The basic-helix I is modeled as a continuous ?-helix because no ?-helix breaking residue is found between the basic region and the first helix. When the basic region of the two peptide monomers are aligned in the successive major groove of the cognate DNA, the hydrophobi...
متن کاملPhylogeny of Ononis in Iran using nuclear ribosomal DNA and chloroplast sequence data
The genus Ononis,embraces more than 85 species worldwide. In the present study, materials of two subspecies of O. spinosa from different localities of Iran alongside some other native species of the genus were included in phylogenetic analyses. In addition, over 50 accessions were obtained from GenBank. In order to clarify the exact number of subspecies of O. spinosa in Iran, datasets were obta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Nucleic acids research
دوره 17 10 شماره
صفحات -
تاریخ انتشار 1989